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The random greedy algorithm for sum-free subsets of Z2 n 


Patrick Bennett* 


Abstract 

S C Z2 n is said to be sum-free if S has no solution to the equation a + b = c. The 
sum-free process on Z271 starts with S := 0 , and iteratively inserts elements of Z2 n , 
where each inserted element is chosen uniformly at random from the set of all elements 
that could be inserted while maintaining that S is sum-free. We prove a lower bound 
(which holds with high probability) on the final size of S, which matches a more general 
result of Bennett and Bohman (ED, and also matches the order of a sharp threshold 
result proved by Balogli, Morris and Samotij (ID)- We also show that the set S produced 
by the process has a particular non-pseudorandom property, which is in contrast with 
several known results about the random greedy independent set process on hypergraphs. 


1 Introduction 

Let H be a hypergraph. A set of vertices S C V ( H ) is called independent if S contains no 
edge of H. The random greedy algorithm for independent sets starts with 5 = 0, and then 
randomly inserts elements into 5 so long as 5 remains independent. Specifically, at the i th 
step, we put 5 := 5 U {uj} where v-i is chosen uniformly at random from all vertices v such 
that 5 U {u} is independent (we halt when there are no such vertices v). The algorithm 
terminates with a maximal independent set. 

One notable instance of this algorithm is the H-iiee process for any fixed fc-uniform 
hypergraph H (where k > 2 so H might be a graph). The process starts with an empty 
hypergraph on vertex set [n], and iteratively inserts randomly chosen hyperedges so long as 
we never create a subhypergraph isomorphic to H. The H-iree process can be viewed as an 
instance of the random greedy independent set algorithm running on a hypergraph T-L with 
vertex set (^), where each edge of the hypergraph H corresponds to a subset of (^) forming 
a copy of H. 

Bohman [3] analyzed the H- free process when H is the graph A 3 , determining (up to 
a constant) how long the process lasts , and bounding the independence number of the final 
graph formed. Bohman’s analysis of the process included information about the independence 
number of the graph produced, and gave a second proof that the Ramsey number i?(3, t ) is 
Q, • In the same paper, Bohman considered the Aj-free process and improved the best 

known lower bound on i?(4,f). Bohman and Keevash 5] went on to analyze the A-free 
process for many other graphs A, resulting in new lower bounds on R(s,t) for fixed s > 5, 
and new lower bounds on the Turan numbers of certain bipartite graphs. More recently, 
Bohman and Keevash | 6 ], and independently Fiz Pontiveros, Griffiths, and Morris | 8 ] proved 

that the A 3 -free process terminates with + o(l)^ log 1 / 2 n ■ n 3 / 2 edges (and also gave 
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better bounds on the indpendence number of the graph produced by the K^-free process). 
We make a conjecture that would generalize this result. With Bohman, the author in [2 
consider the independent set process on a general class of hypergraphs, and proved a lower 
bound (on the size of the final independent set) which generalizes many of the known lower 
bounds for specific hypergraphs. The general bound in [2] applies to some instances of the 
A/-free process where if is a hypergraph, as well as the fc-AP-free process (which chooses 
elements of Z n while avoiding a /c-term arithmetic progression). 


Theorem 1.1. Let r and e > 0 be fixed. Let Li be a r-uniform, D-regular hypergraph on N 
vertices such that D > N e . Define the degree of a set A C V(LL) to be the number of edges of 
LL that contain A. For a = 2,..., r — 1 we define A a (LL) to be the maximum degree of A over 
A £ (^). We also define the 6 -codegree of a pair of distinct vertices v, v' to be the number of 
edges e, e' £ LL such that v £ e \ e!, v' £ e! \ e and \e fl e!\ = b. We let T^iLL) be the maximum 
b-codegree ofLL. 

If 

A e{H)<D^- £ for £ = 2,... ,r — l (1) 

and 

r r _i(H) < d 1 - e (2) 

then the random greedy independent set algorithm produces an independent set S in LI with 


|Sj = fl IV • 



( 3 ) 


with probability 1 — exp {— iV 0 ^ 1 )}. 

The indpendent sets produced by the algorithm tend to have pseudorandom properties. 
For example, the K^-fiee process produces a graph whose independence number is roughly 
the same as it would be in a random graph with the same edge density m , 0 , 0 ), and for 
any fixed i< 3 -free graph G, the number of copies of G in the graph produced by the AVfree 
process is roughly the same as it would be in a random graph with the same edge density 
(see [ 10 ]). [ 2 ] also has a pseudorandom type result for the independent sets produced by the 
algorithm, which generalized Wolfovitz’s result and which they used to bound the Gowers 
norm of the set produced by the fc-AP-free process. We state this result from 0 now. 

Theorem 1.2. Fix s and a s-uniform hypergraph Q on vertex set V(LL) (i..e the same 
vertex set as the hypergraph LL). We let Xg be the number of edges in Q that are contained 
in the independent set produced at the i th step of the random greedy process on LL. Set 
p = p{i) = i/N and let f max be the lower bound 0) on the size of the independent set given 
by the random greedy algorithm given in Theorem li.il If no edge of Q contains an edge of 
LL, i < i max is fixed, \G\p s —> oo and A a (t 7 ) = o(p a \Q\) for a = 1 ,..., s — 1 then 

Xg = \g\p s (l+o(l)). 


with high probability. 

This paper addresses the sum-free process. In this process we look for a set S C Z 2n 
such that S has no solutions to the equation a + b = c. Define our edge set E to be the family 
of all solutions {a, 6 , c} to a + b = c (Such edges {a, 6 , c} are called Schur triples). Note that 
{a, 6 , c} may have 1,2, or 3 distinct elements. We write the generic form {a, 6 , c} with the 
understanding that if, say, 6 = c then we mean the set {a, 6 } and not a multiset with two 
copies of 6 . Thus, S is sum-free if and only if S is an independent set in the hypergraph LL 
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with vertex set Z 2n and edge set E. TL is not uniform but almost all of the edges have size 
3. Indeed, each vertex v / 0 is in 0(1) edges of size 2 and no edge of size 1. 

Theorem o cannot be applied directly to the sum-free process, since for example hi is 
not quite uniform, and also not quite regular. However it is nearly uniform and regular, and 
these issues alone would not present much difficulty in the analysis. There is a more important 
way in which H fails to satisfy the hypotheses of Theorem 11.11 the codegree condition ©. 
Consider the vertices v and — v for v ^ 0, n. Each of v, —v is in about D = 0(n) edges, but 
the 2-codegree of v. —v is also 0(?r) since whenever we have the equation v + b = c we also 
have the equation — v + c = b. Thus, one of the points of interest in this paper is to see how 
to overcome this issue. 

The largest sum-free subset of Z 2n is the set consisting of the odd numbers. Indeed, 

/, \l/2 

Balogh, Morris, and Samotij in |T proved that p = (j is a sharp threshold above 
which we have the following: if G p is a random subset of Z 2 „ where each element is included 
with probability p independently, then w.h.p. the largest sum-free subset of G p is simply the 
set of its odd elements. Roughly speaking, the fact that odd + odd = even imposes a very 
specific structure on maximum sum-free subsets of random sets in Z 2n . However, this special 
structure ceases to be relevant for very sparse random sets in Z 2n . 

Our main theorem is as follows: 


Theorem 1.3. 

at least 


With high probability, the sum-free process run on Z 2n produces a set of size 


, ._ 1 (, on ioglogn 
®o •— /— ( 1 20 • 

V 3 V logn 


n 2 log 2 n 


We conjecture that a matching (up to a constant factor) upper bound holds, since 
intuitively the Sum-free process should not produce sets that have density larger than the 
threshold in [T], 

Another point of interest in this paper is to demonstrate a non-pseudorandom statistic 
of the set 5 produced by the sum-free process (in contrast to all the pseudorandom results 
concerning other instances of the hypergraph independent set process). It turns out that the 
set 5 tends to contain a large number of pairs v, —v, much larger than one would expect 
in a (uniformly chosen) random set of size |5|. Indeed, if 151 = 0(n^ log 2 n) were chosen 
uniformly at random, then we would expect 5 to contain 0(logn) pairs v,—v. However, as 
we will see in the last section of the paper, the number of such pairs in 5 actually grows like 
a power of n. 

Our analysis of the sum-free process on Z 2n easily extends to Z 2n+ i. Note that Z 2n+ i 
also has linear-sized sum-free subsets (e.g. the set of all odd numbers less than n ). However, 
since there is no result analogous to pp for Z 2rl+ i, we work in Z 2ra . 


2 The Algorithm and Associated Random Variables 

The random greedy algorithm for sum-free sets starts with a sum-free set 5(0) := 0, and a set 
Q(0) of elements that may be inserted into the 5(0) without spoiling the sum-free property. 
Since 0 + 0 = 0, the element 0 cannot be part of any sum-free set. Any singleton except 
for {0} is sunr-free, so Q( 0) = Z 2n \ {0}. At step i, the algorithm selects an element s(z) 
uniformly at random from Q(i ) and puts S(i + 1) := S(i) U {s(z)}. The algorithm then 
determines Q(i + 1), the set of elements that could potentially be inserted in 5(z+l) without 
spoiling the sunr-free property. If Q(i + 1) is empty, the algorithm terminates with a maximal 
sum-free set. 
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We call the elements of S(i) chosen , the elements of Q(i) open , and all other elements 
of Z 2 n closed. As we noted above, at the start of the algorithm 0 is closed and every other 
element is open. 

For k = 2, 3 we define Ek(i) to be the set of edges e C Q(i)uS(i) such that |eflQ(z)| = k. 
We will also define several more random variables which (more or less) represent the degrees 
of vertices. We will partition the set of edges containing v according to what role v plays in 
the corresponding equation a + b = c. For v £ Z 2 „ \S(i)\ {0} and k = 1,2,3 we define the 
random variables D kiL (v,i), the set of edges e G E such that v G e, e \ {u} C Q(i) U S(i), 
\e D Q(i) \ {u}| = k — 1, and v appears on the left of the equation corresponding to e (i.e. v 
plays the role of a or b in the eciuation a + b = c); and for v G h 2n \S(i), we define D k ji{v, i), 
the set of edges e G E such that v G e, e \ {u} C Q(i) U S(i), \e fl Q(i) \ {u}| = k — 1 and v is 
on the right side of the eciuation. We also define the random variable 

D 2 {v, i) := {q G Q(i) : q ± v, 3 e G D 2 ,l(v, i) U D 2 ,r{v, i).q G e}. 

Many of the variables we just defined take two arguments (a vertex v and the step z). Some¬ 
times for short hand we will suppress the i. 


3 Heuristically anticipating the trajectories 

We use some heuristics to anticipate the likely values of the random variables throughout the 
process. Intuitively, we will assume that certain aspects of the hypergraph 'H(i) are the same 
as what they would be if S(i) were a set of i uniformly chosen random elements (rather than 
having been chosen to satisfy the sum-free property). 

Consider a vertex v ^ 0, and let’s estimate the number of ways v is closed by i random 
elements, v is in about 3 n edges of 7i , and nearly all of those edges have size 3. Thus, we 
estimate the number of those edges {v, x, y} such that both x and y have been chosen as 



where we define 

t = t(i) := n _1/,2 z 

. We heuristically assume that the number of times v is closed is a Poisson random variable, 
so the probability that v is open should be 

p = p(t.(i)) := e"3* 2 . 

Also, we assume that most small sets W of vertices behave independently (i.e. the probability 
they are all open is p\ w \). Thus we make the following predictions: 

Q ~ 2 np E 3 w 2 n 2 p 3 E 2 ft: 3 n 3 ^ 2 tp 2 

D 3 ,l{v) « 2np 2 D 3 ' R (v) « np 2 D 2 , L (v) w 2n 1/2 tp D 2 jR (v) « n 1/2 tp 

However, there are certain properties of 'H (namely the codegrees) that will force us to 
take a bit more care with certain elements. The elements v and — v are correlated, because 
v + b = c ■<=> —v + c=b. In other words, H has a lot of edges {u, b, c} that have 
corresponding edges {— v,b, c}. Thus, if — v is chosen, then a lot of edges containing v are 
removed from the hypergraph, in particular D 2 ^l(v) = 0. Also, even when — v ^ S, there 
may be many ways to close v that would simultaneously close —v, so we do not expect v 
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and —v to behave independently. With some care, though, we can anticipate what the joint 
distribution ought to be. 

TL has about 2 n edges {v, b, c} with corresponding edges {— v, b, c}, about n edges {u, 6, c} 
without any corresponding edge {— v, 6, c}, and about n edges {— v,b, c} without any corre¬ 
sponding edge {u, b , c}. Thus, the expected number of edges closing either of v, —v is roughly 
t 2 , and the probability that both v, —v are open is e -t = p 4 ^ 3 . Thus we make a prediction: 

D 3 ,r( 0) « np 4/3 


If — v is chosen, then it becomes more likely that v will stay open for longer. Indeed, since 
—v is chosen we do not have to worry about v ever being closed by any of the corresponding 
edges containing —v. In this situation, v can only be closed by its other n edges, so following 
a similar calculation the probability that v is open given that —v is chosen ought to be 
e"4 4 =p 1 / 3 . Thus we predict 

£> 2 ,fl( 0 ) « n 1/2 tp 1/3 


Finally we predict the value of D i.h( 0). At each step, the variable £>1^(0) either stays 
the same or increases by 1. The probability of increasing is 


£>2,fl(0) 

Q 


1 _ 1 _ 2 
—n 2 tp 3 
2 


integrating the above expression gives the prediction: 


-Di.fl(O) ~ 2 (p 3 “ x ) • 

The variables C(v) for v ^ 0 will be much smaller, and we will prove that later. 

We finish this section with a general conjecture about the random greedy algorithm for 
independent sets in hypergraphs. This conjecture ought to apply to all sufficiently ’’nice” 
(almost uniform, almost regular, not too sparse) hypergraphs. In general, the algorithm 
should terminate soon after the number of open elements is negligible compared to the number 
we’ve already chosen. Our conjecture is that we can predict when this happens using the 
heuristically derived trajectory for Q. 

Conjecture 3.1. The step i at which the random greedy independent set algorithm terminates 
w.h.p. is asymptotically the value i such that Q « i. In particular, the sum-free process 
terminates when 2 np « n 3 ^ 2 t, so when 



Even proving the lower bound of the conjecture presents considerable difficulty, partic¬ 
ularly due to the fact that we suspect I?i : _r( 0) becomes larger than D 2 (v ) by that time. 


4 Proof Overview 

We appeal to the usual differential equations method to establish dynamic concentration of 
the random variables around the trajectories we heuristically derived. See [l2j for an intro¬ 
duction to the standard method. We augment the standard method so as to take advantage 
of the self-correcting nature of some of the variables. For each variable V and each bound 
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(i.e. upper and lower) we introduce a critical interval Iy = [ay, by]. This interval varies with 
time and has one endpoint at the bound we are trying to establish with the other slightly 
closer to the expected trajectory. We only track V if and when it enters a critical interval. 
If V enters the critical interval at step j we ‘start’ observing a sequence of random variables 
that is designed to be a sub- or supermartingale with the property that if V eventually passes 
all the way through the interval (and thereby violates the bound in question) then this mar¬ 
tingale has a large variation. When working with the lower bounds we consider the sequence 
V — ay. This sequence should be a submartingale with initial value (i.e. value at step j ) 
roughly by — ay. Note that this sequence becomes negative when the bound in question is 
violated. Similarly, when working with the upper bound we consider the sequence by — V, 
which should be a supermartingale. The event that V ever violates one of the stated bounds 
is then the union over all ‘starting’ points j of the event that one of the martingales that start 
at this point has a large variation. We prove Theorem 11.31 by an application of the union 
bound, taking the union over all variables V and all ‘starting’ points for both the upper and 
lower bounds. 

The reason that we focus our attention on these critical intervals is the fact that the 
expected one-step changes in the variables we consider have self-correcting terms. These 
terms introduce a drift back toward the expected trajectory when V is far from the expected 
trajectory. By restricting our attention to the critical intervals we make full use of these 
terms. In some cases, we manage to prove error bounds that actually decrease as the process 
evolves (i.e. we prove that the variable stays within a window of decreasing width) See 
m and [7] for early applications of this self-correcting phenomenon in applications of the 
differential equations method for proving dynamic concentration. As we noted above, the 
critical interval method we use here was introduced in (4J. 

Another important feature of the proof is that we track some “global” variables, namely 
E 2 and E 3 . One can produce a shorter proof of Theorem 11.31 for some io that is smaller by 
a constant factor, without tracking E 2 or E 3 . We now (very roughly) describe how tracking 
E 2 , E 3 benefits our analysis. We use our control over the local variables (i.e. variables of the 
form D-,-(v)) and Lemma Hu2l to get amplified control over the global variables. This helps 
us track Q since the expected one-step change in Q can be written in terms of E 2 - Doing so 
is much better than writing the expected one-step change in Q in terms of the local variables, 
which have larger relative errors than the global ones. 

For an arbitrary random variable V we define 

A V(i) = V{i + 1) - V(i). 

We let Ti be the filtration of the probability space given by the first i edges chosen by the 
random greedy matching process. 

1 ( log log n\ 1 1 

Define io := —p= 1 — 20 • — ; - n 2 log 2 n. Note that we have 

V3 V log n J 

p(i(io)) = 7 W 1 / 4 log 10+o « n. 

Define the stopping time T as the minimum of io and the first step i that any of the following 
bounds fail: 


6 



\D 3 ,l(v) ~ 2np 2 \ < 2f d3 Vi> ^ iS'(i) U {0} 
D 2>L (v) - 2n*tp <2f d2 Vv £ ±S(i) U {0} 

I A3,_r( 0) — np 4 / 3 1 < f d3>0 


\E 3 - jQ 3 n 4 | < /e3 


Di, l {v), D 1>r (v) < log 2 n Vu^O 
I Q - 2np| < f q 


D 3 ,r( v ) ~ n P 2 \ < fd3 Vu ^ S(i) U {0} 
D 2 ,r(v) — n^tp < f d 2 Vu ±S(i) U {0} 


^2,fl(0) - n 1 / 2 ip 1/3 


< fd2,0 


E 2 --Q z n 2 t 


< /e2 


A,*(0) - \ (p-f - l) 


< fdl,0 


Theorem ll.3l is proved in sections 6-11. We will show that, for specific choices of the error 
functions /, that the stopping time T = i 0 with high probability. This dynamic concentration 
result will in turn imply that the algorithm produces a set of size at least *o with high 
probability. As mentioned above, for most of the variables we use critical intervals to prove 
the error bounds. So these variables V each have two functions fy and gy. fy is the farthest 
V can ever be from its trajectory, and the width of the critical interval is fy — gy. In other 
words, we start paying attention to V only when its distance from its trajectory becomes 
as large as gy. These error functions must be carefully chosen to satisfy certain inequalities 
(“variation equations”), which arise from calculations in subsequent sections. If the reader 
wishes to look ahead at the variation equations, they are on lines 0, @, (ED. m , ED, 

ED , ED, and ED- 


f d 2 = ni log 3 n (l + 3f + 2 1 2 + 31 3 ) 
f d 2 ,o = P~%n* log 3 n(l + 2t + 2t 2 + 2t 3 ) 
fd 3 = pn^ log 3 n (l + At + 7 1 2 ) 
fd 3 ,o = ni log 3 n (l + 3t + 5t 2 ) pi 
f e2 = n log 6 n (1 + 20 1 + 701 2 + 130t 3 + 100f 4 + 120t 5 ) 
f e3 = pni log 6 n (l + 20t + 70t 2 + 160t 3 + 150t 4 ) 
f q = p~ l ni log 6 n (1 + 2t + 10 1 2 + 35 1 3 + 40t 4 ) 
fd i,o = p~inri log 3 n (2 + 2 1 + 2 1 2 ) +p~i logn 


gd 2 = fd 2 ~ni log 3 n 

— — — 3 

9d2,o = Jd 2 ,o ~ P 3 n 4 log n 
gd 3 = fd 3 - pni log 3 n 

1 3 q 

gd3,0 = fd3,0 p 3 n 4 log n 
g e 2 = fe 2 - n log 6 n 
g e 3 = fe 3 -pni log 6 n 
9q = fq — P~ 3 ni log 6 n 


A note on checking the variation equations: they can be verified in a straightforward 
(though perhaps a bit tedious) manner as follows. On the left side of the inequality, plug in 
the values of all of the functions. There will be a common power of n, a power of logn and a 
power of p that can be factored out of everything, leaving a factor which is just a polynomial 
in t. That polynomial will have no positive coefficients and so we just bound it using its 
constant term (which is valid since t > 0). For example, to verify (ED, note that 


— 6n 2 tgd 3 + 0 pfd 2 — 2 n 2 f' d3 =pn i log 3 n ■ (—2 —7 1 — 3 1 3 ) < —2pn 4 log 3 n 
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5 Preliminaries 


In this section we prove several bounds that are necessary for our calculations. Most of these 
bounds are used to justify big-0 terms later. 

For v ^ 0, we anticipate D 2 (v) « D 2L {v) + D 2 R (v). But to make this estimate 
valid we need to bound the number of q £ Q such that q ^ v and there are two edges 
ei,e 2 £ D 2 > l(v) U D 2 ^r(v) such that q £ ei,e 2 - Note that v is only in 0(1) many edges 
of size 2, so let us assume |ei| = |e 2 1 = 3. In other words, for some si,S 2 £ S' we have 
e-j = {v,q,Sj}. There are cases to consider, according to how the corresponding equations 
are arranged. Without loss of generality we have one of the following cases: 

1. v + Si = q and v + q = s 2 . Then Si + S 2 = 2u (so we have {si, S 2 } £ Di A 2v ))- 

2. v + si = q and q + s 2 = v. Then si + S 2 = 0 (so we have {si, S 2 } £ D\ ,.r(0))- 

Thus, before stopping time T we have 

D 2 {v) = D 2 , l (v) + D 2tR {v) - 0(1 + D hR (2v) + Di tR ( 0)) (4) 

We now show that if v ^ 0 then v does not get closed too many times. Note that Di^{v) 
only increases in size on steps when we choose an element of D 2j l(v), and on such steps 
Di t L(v) can increase by at most 2. Before the stopping time T, we have that D *'q V - < 2 n~^t,. 

Thus, C(v, i) is stochastically dominated by 2 R where R ~ Bi(i , 2 n~^ log 5 n). An application 
of the Chernoff bound then tells us that R does not get bigger than log 2 n w.h.p., and thus 
the stopping time T does not happen due to the condition on Di^{v). Bounding Di R {v ) is 
similar. 

We now bound the size of D 2 (v 1 ) D D 2 (v 2 ), for Vi,v 2 ^ 0 ,Vi ^ ±v 2 - Again, each of 
Vi,v 2 is only in 0(1) many edges of size 2, so we assume edges have size 3. Then for each 
q £ D 2 {v 1 ) fl D 2 {v 2 ) there is a pair Si, S 2 G S such that both {ui, si, q} and {^ 2 , s 2 , q} are in 
E. There are cases to consider according to how each of the equations is arranged, but in each 
case we reach one of the following conclusions: {iq + v 2 , s 1 , S 2 } £ E, {iq — v 2 , si, s 2 } £ E, or 
{v 2 — Vi,si, S2} £ E. Thus, by our bounds on the sizes of sets of the form Di l and Di R we 
have, for v\, v 2 ^ 0, Vi ^ ±v 2 that 

\D 2 (vi) n D 2 (v 2 ) I = 0(log 2 n) (5) 


To finish this section, we present a couple of lemmas which we will use several times to 
estimate things. The following lemma will be used to estimate fractions based on estimates 
of the numerator and denominator. 

Lemma 5.1. For any real numbers x,y,e x ,e y , if we have x,y ^ 0 and \^-\ , y < |, then 


x + e x x 

y + e v V 


ye x - xe v 



+ 0 




Proof. 


x + e x 
y + e v 



- 1 



1 


□ 


The next lemma is used to estimate global parameters (e.g. the total number of edges) 
based on estimates of local parameters (e.g. vertex degrees). 

Lemma 5.2. Suppose ( xf)i^i and (; yf)iei are rea l numbers such that \xi — x\ < 5 and |j/j — y\ < 
e for all i € I. Then we have 


y^Xiyi 

iei 



< 2\I\6e 


Proof. The triangle inequality gives 

y2( Xi ~ x ^ Vi ~ v ^> 


iei 


< \I\6t. 


Rearranging this inequality gives 

y~l xpyt = X Vi + y Xj - \I\xy ± \I\Se 


iei 


iei iei 


Mg*) 


Ul 


yy,vi -y ) ± we. 


iei 


□ 


6 Tracking the variables 

In this section we prove dynamic concentration of the D 2 -type variables. We start with 
T> 2 ,l{v) for v ^ 0. 

For any fixed index j < i 0 we define the stopping time Tj as the minimum of the following 
indices: T, j and the smallest index i > j such that d? 2 ,L(^ 5 *) is not in the critical interval 

2 nh.p + 2 g d2 , 2 nhp + 2 f d2 

We need to estimate E[AD 2 ,L(v)\J 7 i] for j < i < Tj. The one step change A D 2 ,l{v) 
has both positive and negative contributions. D 2 ,l{v) gains a pair {v,q} when we choose 
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an element q' such that {v,q,q'} £ D 3 l(v), except in the case where we happen to have 
q £ D 2 (q') (in which case we may conclude that either 2q' or 2 q is in D 2 (v)). D 2 ^{v) loses 
a pair {i>, 6} when b is chosen or closed. 

Thus we can put 


E[AD 2 , L {v)\Fi\ = ^ < 2 D 3 , l (v) + 0(n 2 tp) - ^ D 2 (q) 

{ {v,q,s}£D 2 ,L{v) 

In the sum above, it is intended that q £ Q,s £ S, so we are summing all the ways to 
lose an edge in {u, q, s} £ D 2 ^{v) due to closing q. The big-0 term absorbs the error due 
to: the possibility that we choose some q £ {v,q,q'} £ D 3 ^{v) such that q' £ D 2 {q ); the 
possibility of losing {t;, q, s} due to choosing q ; and the effect of the 0(1) many edges of size 
2 in D 2 ,l{v). 

Almost all of the terms in the sum are D 2 (q) « D 2tR (q) + D 2iR (q) « 3 n^tp, by 0) and 
our estimates on the variables we’re tracking. However, we may have some terms with q such 
that — q £ S , in which case D 2t L{q) = 0. However, supposing that {v,q,s} £ D 2 ^l(v) and 
—q £ S , we conclude (by considering cases) that either {v, s, —q} £ Di iR (v) or {— v, s , —q} £ 
Di,r(—v). Thus there are 0(log 2 n) many such terms in the sum. Now applying 0, our 
estimate for Q , lemma 150 and recalling t = ©(log 1 / 2 n) we get 


E[AD 2)L {v)\J : i] - 2p+ -n 2 tD 2}L (v) 


<2 n x p 1 fd3 + 3n 2 tf d2 


+0 (n Mog nf q + n 1 p 1 f d2 + n = log 3 n + n 3 \ognD ltR 
= 2n~ 1 p~ 1 f d3 + 3 n~hf d2 + O (n~% log 1 


” 8 np 1 


and if we use the fact that D 2 < l(v) is in the critical interval, we get 


E[AD 2tL (v)\Ti] <2 p- 3 t 2 p + 2 n 1 p 1 f d3 + 3n h(f d2 - g d2 ) 

+0 [n~ 2 log 8 np -1 ^ (6) 

For each vertex v ^ 0 , we define the sequence of random variables 

n+ (v A •= / £> 2,L(w,*)-2n3fp-2/ d2 :±v<£S(i) 

2,i ’ \ D 2 L (v,i — 1) -.otherwise 

We will show that the sequence of random variables D^ L (v, j )... D^ L (v, Tj ) is a supermartin¬ 
gale, and then use a deviation inequality to show that w.h.p. L {v,i) is never positive (and 
hence D 2 ^{v,i) does not violate its upper bound). For j <i < Tj 

E\A D 2 l ( v ) \J~i\ < 2 n~ 1 p~ 1 f d3 + 3 n~h(f d2 - g d2 ) - 2n~?f' d2 

+0 (n~? log 8 np" 1 + n -1 /rf 2 ) 

< — Q [it,-* log 3 n'j 


10 




Note that we have used ©, and approximated the 1-step change of deterministic functions 
using derivatives (i.e. Taylor’s theorem). The last line can be verified by observing that 


2 n V 1 fd 3 + Zn n(f d 2 - gd 2 ) - 2n 2 f' d2 < -n * log 3 n 


(7) 


Now we use Azuma-Hoeffding to bound the probability that the supermartingale 
D£l( v > j) ■ ■ ■ s t ra y s too far. 

Lemma 6.1. Let Xj be a supermartingale, with |AXj| < Ci for all i. Then 


P(X m 


/ 

Xq > a) < exp 

V 


\ 


i<.m 



We now bound the largest possible 1-step changes in the supermartingale. By examining 
all the contributions (positive and negative) we see that the largest possible 1-step change is 
a large negative contribution due to the algorithm choosing some vertex v' such that D 2 {v') 
contains a lot of vertices q in an edge {t>, q , s} £ T , 2 ,l('c)- Using (O to bound the number of 
such q (for any fixed v r ) we get 



0(log 2 n) 


Thus, if D 2 ,l(v, i) crosses its upper boundary at the stopping time T, then there is some step 
j (with T = Tj) such that 


D 2,l( v >3) < - 2 (/d 2 (i(j)) - 9 d 2 {t{j))) + 0(log 2 n) 


and L (v,Tj) > 0. Applying Azuma-Hoeffding we see that the event that the supermartin¬ 
gale D£ l (v) has such a large upward deviation has probability at most 


exp 



f (n* log 3 n) 2 \ 
V *o(log 2 n) 2 J 


= o(n 2 ) 


3 1 

As there are at most 0(n 5 log 5 n ) such supermartingales, with high probability none of them 
have such a large upward deviation, and D 2 : l{v) stays below its upper bound for all v. 

The lower bound for Z? 2 ,l(^) is similar, as are the bounds for D- 2 ,r(v) for v ^ 0. 

Now we address D 2! r{0). For any fixed index j < ig we define the stopping time Tj as 
the minimum of T, j and the smallest index i> j such that D 2j R(0,i) is not in the critical 
interval 

n 5 tp3 +g d2fi ,n^tp3 + f d2y0 . 


Now 


S[A£> 2 , fl (0)|7i] = - 


2Zl3,fl(0) + 0(n 2 tps) - 


D 2 (q) 


{0,q,s}£D 2 , R (0) 


and note that for every {0, q, s} £ D 2 _r(0) we have that —q = s&S and so D 2 (q) = D 2 
Thus 
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( 8 ) 


E[AD 2 ,R(0)\J r i] -p3 + in hD 2 ,R(0) 

^ ~l 1 2 / -t 2 \ 

< n~ 1 p~ 1 f d 3,0 + -n _ 3 tp _ 3 /d 2 + O ^n _ 1 p _3 logn/ g + n~ 1 p~ 1 f d2y0 f d2 J 
= n~ 1 p~ 1 f d 3,0 + \n~^tp~i f d2 + O (n~?p~% log 8 nj 
We define the sequence of random variables 

Dt^,i) ■= D 2 ,r( 0, i) - nhp* - f d2 ,o 
We will show that the sequence D 2R (0, j)... R ( 0, Tj) is a supermartingale. For j < i < Tj 


E 




(0, i) |*7q < ~n hg d2 $ + n l p 1 f d3 ,o + ^ri hp % f d2 - n 2 f' d2 


,o 


+o 


p 3 log 8 n + 


< -Cl 


(n 2 
log 3 n- p-i'j 


i-'fko) 


Note that in the first line we have used © and the fact that D 2 R (0) is in the critical interval. 
The last line can be verified by observing that 



Again, the biggest possible 1-step changes in the above supermartingale comes from the 
intersections of D 2 sets. Using 


AD+ R (0,i) 


0(log 2 n) 


Thus, if £> 2 ,.r(0, i) crosses its upper boundary at the stopping time T, then there is some step 
j (with T = Tj ) such that 


D 2,r( °.J) < -(fd2,o(t(j)) ~ 9d2,o(t(j))) +log 2 n 


and D 2 R (0,Tj) > 0. Applying Azuma-Hoeffding we see that the event that the supermartin¬ 
gale D^ R (0,i) has such a large upward deviation has probability 


exp 



/ (n‘ log 3 np 3 ) 2 
\ *o(log 2 n) 2 


°( n T 


As there are at most 0(n 5 log 3 n) such supermartingales, with high probability none of them 
have such large upward deviations, and D 2 R (0) stays below its upper bound. Proving the 
lower bound for D 2 R (0) is similar. 


7 Tracking the D :i variables 

The variable D 3tR (v) is nonincreasing, and we lose a triple {u, q , q'} £ D 3< l(v) whenever q or 
q' is closed or chosen. Thus we have 

E[AD 3tL (v)\Ti} = -^ J2 D 2 (q)UD 2 (q')U{q,q'} 

{v,q,q'}eD 3 tL (v) 
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Before the stopping time T, we can estimate the above expression. Most of the terms in the 
sum will have —q,—q' ^ S. Supposing that {v,q, q'} £ D 3 j l(v) and — q £ S , we conclude 

that one of q' or —q' must be in /^(w). Thus there are O ^n^tp^J many such terms in the 
sum. Now applying ([4]) , our control on the variables before the stopping time T, lemma [5TT1 
and recalling t = 0 ( log 1 / 2 n) we get 


E[AD 3 tL (v)\J r i ] + 3n 2 tD 3 , L (v) 


< 6pfd2 


+o 


(pDiAO) 


-m-i 


+ p log n + n 2 tpfq + n p 


fd2fd3^ 


= 6 pfd 2 + o (log 8 n) 


( 10 ) 


For a fixed index j, and a fixed element v ^ ±S(j) , we define the sequence of random 
variables D^ L {v){j)... D^ L (y){Tj), where 


n + , .x / D 3tL ( v ’ *) “ 2 n P 2 - 2/^3 : v £ ±S(i) 

3 | Df L (v,i— 1) -.otherwise 

and the stopping time Tj is the minimum of T, j and the smallest index i such that D 3 ^l{v, i) 
is not in the critical interval 


[2np 2 + 2 g d3 , 2 np 2 + 2 f d3 ] . 

We will show that the sequences D 3 < L{v,j) ■ ■. D 3 t L(v,Tj) are supermartingales. For j < i < 
Tj, we have the inequality 


E 


AD 3,l( v i OI^J < ~ 6n 2 t9d3 + 6pfd2 - 2 n 2 f ' d 3 

+0 (logNz + n -1 /^) 


< -fi 


(pn 4 log 3 nj 


Note that in the first line we have used m and the fact that D 3 > l(v) is in the critical interval. 
The last line can be verified by observing that 


- 6 n 2 tg d3 + 6 pf d 2 ~ 2 n 2 f d3 < -2 pn* log 3 n 


( 11 ) 


Now we use Azuma-Hoeffding to bound the probability that the supermartingale strays 
too far. By considering the total number of elements that get closed in any one step, we see 


AD Il ( v >*) 


< O 


(ji 2 tp^j 


Thus, if D 3 ! l(v, i ) crosses its upper boundary at the stopping time T, then there is some step 
j (with T = Tj) such that 


D 3 L {v,j) < -2 (f d3 - g d3 ) + O (n*tpj 

and D^ l (v, Tj) > 0. Applying Azuma-Hoeffding we see that the event that the supermartin¬ 
gale L (v,i) has such a large upward deviation has probability 
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exp 


°{n 2 ) 


-n 


( 

(n* log 3 npy 

Y,(nhp ) 2 

\ i<io 


\ 


(1 + 0 ( 1 )) 


/ 


3 1 

As there are at most Oyn 2 log 5 n) such supermartingales, with high probability none of them 
have such large upward deviations, and D 3 R (y) stays below its upper bound for all v. 

The lower bound for D 3 jJv) is similar, as are the bounds for D 3 y(v) for « / 0. 

Now we address D 3 , R (0). Using the same methods to estimate E[AD 3 yL (v)\ J 7 ,;] before 
the stopping time T, we get 


B[A£> 3l fl(0)|7i] = -^ ^ D 2 (q)UD 2 (-q)U{q,-q} 

{0,q,-q}£D 3 , R (0) 

but note that each term in the sum has D 2 y(q) = D 2 y(—q) and so the term is roughly 
4n 1/,2 tp. Thus 


E[AD 3tR (0)\Fi] + 2n~ 5 fD 3j fl(0) 

< 2pif d 2 + O (p^Diy(0) + n~^p^f q + n~ 1 p~ 1 fd2fd3,o) 

= 2 p^ f d2 + O (p~% log 8 n) (12) 

We define the sequence of random variables 

-£>3^(0, i) ■■= D 3 tR (0, i) - nhpi - f d 3 ,o- 

Now for any fixed index j < io we define the stopping time Tj as the minimum of T, j and 
the smallest index i> j such that D 3 tR (0, i) is not in the critical interval 

i 4 14 ' 

n 2 tp3 + gd3,o,n 2 tp3 + } d 3,o 

We will show that the sequence D 3 R (0,j) ■ ■ ■ ^ 3R (0, Tj) is a supermartingale. For j < i < Tj 


E 


-3 f' 

<23,0 


< - 


AD+ R (0,i)\Ti <-2 n hg d 3 ,o + 2p* f d2 - n 5 /, 
+0 [p~i log 8 n + n _1 /^ 3) 
fl (p^ni log 3 nj 


Note that in the first line we have used m and the fact that D 3 R (0) is in the critical 
interval. The last line can be verified by observing that 


- 2 n 2 tg d 3,0 + 2 ps fd 2 ~n 2 f d30 < -p^n* log 3 n. 


(13) 


Now we use Azuma-Hoeffding to bound the probability that the supermartingale strays 
too far. We have 
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&D+ R (0,j) 


< O 


(n 2 tp'j 


Thus, if D 3 . r { 0, i ) crosses its upper boundary at the stopping time T, then there is some step 
j (with T = Tj ) such that 


^ ~(fd3,o - 9d3,o) + O (nhp'j 


and D+ r (Q. Tj) > 0. Applying Azuma-Hoeffding we see that the event that the supermartin¬ 
gale R ( 0 ) has such a large upward deviation has probability o(n _1 ). As there are at most 

0 {n^ log^ n) such supermartingales, with high probability none of them have such large up¬ 
ward deviations, and D 3 <R {0) stays below its upper bound. The lower bound for £> 3 ^( 0 ) is 
similar. 


8 Tracking the E 2 variable 

We have the formula 



D 3 ,L(q) + D 3 tR (q) + 0{nhp) 


E D 2 tL {q') + D 2 ,R(q') 

q'£D 2 (q) 


1 

Q 




qGQ 


+ 0 (pn 2 logn - 


log 2 n 


+ p-*) 


We apply to the sum EqeQ ^2 (<?) 2 , except we have to be careful because there 
are a few terms that are significantly smaller than others, namely the terms corresponding 
to q € — S(i). Using simple bounds on the number of such terms, we have 


E 

qeQ 




18Q/J2 + 0 


and 


E[AE 2 \Ei\ 


3 E 3 \E\ 

Q Q 2 


< I 8 /J 2 + 0 \pn 2 log 2 n - 


■ log 2 n 


(14) 


For a fixed index j, we define the sequence of random variables E£(j)... E%{Tj), where 
E+(i) :=E 2 (i)-^Q(i) 2 n-h-f e2 

and the stopping time Tj is the minimum of T, j and the smallest index i such that E 2 is 
not in the critical interval 


3 1 3 1 

-Q{i) 2 n~^t + g e2 , -Q{i) 2 n~^t + f e2 
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We will choose the functions g e 2 ,fe 2 so that these are supermartingales. We have the 
inequality 


E [A (. E+(i )) |7i] 

3^3 4if.2 2 3 —1 3 _ I / 2£/2 , 

< TT - TIT + 18 /d2 - 7 n Q -~A n 2t Q( —7T )~ n 2 ft 


Q Q 


Q 


le 2 


+0 (pn 2 log 2 n + log 2 n + p 2 +n 1 /" 2 j 


3 

Q 


E 3 ~ ~-n 1 Q 3 I - 


4£ 2 

Q 2 


E 2 --n HQ ) + 18 f d2 -n 2 f e2 


+O (pn 2 log 2 n + log 2 n + p 2 +n 1 


< 2 n P 1 fe3 ~ 3tn 2 g e2 + 18/J 2 — n 2 f' e2 


+0 


< -n 


(pn 2 log 2 
(n 2 log 6 nj 


n + log 2 n + p 5 +n 1 f "2 + n 2 P z fe 3 fq + n 2 p 2 / 2 2 + n 2 tp z f q f e2 


- 2-2 


- 2„-2 


— — 1 —2 


Note that in the first line we have used m- The second line sheds some light on 
the reason why we track E 2 in terms of Q instead of a completely deterministic function. 
By doing so, we arrive at a supermartingale calculation in which main terms neatly group 
together and cancel, leaving only a few small error terms behind. The last line can be verified 
by observing that 

^n~ 1 p~ 1 f e 3 - 3 tn~?g e2 + 18 fj 2 - n~if e2 < log 6 n (15) 


Now we use Azuma-Hoeffding to bound the probability that the supermartingale strays 
too far. We have 


\&E+(io,j)\ < 0(n 2 tpf d2 ) = 0(n* log 5 n ■ p) 

Thus, if E 2 (i) crosses its upper boundary at the stopping time T, then there is some step j 
(with T = Tj) such that 


EtU) < ~(/e 2 - g e 2 ) + 0 {ni log 5 n • p) 

and E 2 (Tj) > 0. Applying Azuma-Hoeffding we see that the event that the supermartingale 
E^ has such a large upward deviation has probability at most 


exp 


( 


-n 


(n log 6 


y (n 4 log 5 n ■ p) z 

\^o 


o(n Y 


As there are at most 0(n 2 log^ n) such supermartingales, with high probability none of them 
have such large upward deviations, and E 2 stays below its upper bound. The lower bound 
for E 2 is similar. 
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9 Tracking the E% variable 


We have the formula 


E[AE 3 \JFi} 


1 

Q 


E 

q&Q 


D 3 ,L{q) + D 3 ^ R (q) + E (D 3 ^(q') + D 3 tR (q')) 

q'£D 2 (q) 


1 


Q 


E £>2 ^) • (■ D 3 >L (q ) + D 3 , R (q)) + 0{np 2 ) 
qeQ 


We will apply (15.21) to the sum J^veQ Aa(v)' D 3 ( y ), and again we must be careful because a 
few of the terms are smaller than others. We arrive at 


E D ^ v ) ■ A>(w) € ± 18Q/ d2 /d3 + o (: nHV) 

v<=Q ** 

And so 


< 18/d2/d3 + O ( np 2 log n) (16) 

For a fixed index j, we define the sequence of random variables (j)... ( Tj ), where 

E+(i) := E 3 (i) - ^Q(i) 3 n _I - f e3 

and the stopping time Tj is the minimum of T, j and the smallest index i such that E 3 is 
not in the critical interval 

i<3(i) 3 n -1 +g e3 , -Q(i) 3 n _1 + f e3 ■ 

We will choose the function g e3 , f e3 so that these are supermartingales. We have the inequality 


E[AE 3 \T t \ + 


6E 3 E 2 

~Q 2 ~ 


E [AE+(i)\Ti\ 

6E 3 E 2 1 -i 

< -^-f- 18/d2/d3 - ^ri 

+0 (np 2 log n + n _1 /" 3 ) 
9 i 

<--n 2 tg e3 + 18f d2 fd 3 - n 
+0 (n~ 2 p ~ 2 f e 3 f e2 + n~ 

< — fi (nlog 6 n ■ p) 




h f'e 3 

’ P~ 1 fe 3 fq + np 2 log n + n 



Note that in the first line we have used (USD- Similarly to the analogous calculation for 
E 2 , we were able to group main terms together and cancel them. The last line can be verified 
by observing that 

- ^n~hg e3 + 18f d2 f d3 - n~? f e3 < -2pnlog 6 n (17) 
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Now we use Azuma-Hoeffding to bound the probability that the supermartingale strays 
too far. We have 


| AE^ (i )| < 0 (np 2 f d 2 +nhpf d 3 ) = 0 (n ° log= n-p 2 ). 

Thus, if £3 (i) crosses its upper boundary at the stopping time T, then there is some step j 
(with T = Tj) such that 

E%(j) < ~{fe 3 - ge 3) + 0 (ni log^ np 2 ) 

and E£(Tj) > 0. Applying Azuma-Hoeffding we see that the event that the supermartingale 
E£ has such a large upward deviation has probability at most 


exp < — 


( ( I, 6 \ 2 \ 

( n 2 log n • p 1 

eI 

i<io 


n 4 log 2 n • p 


O' 


/ 


= o(n x ) 


As there are at most 0{n 3 log^ n) such supermartingales, with high probability none of them 
have such large upward deviations, and £3 stays below its upper bound. The lower bound 
for £3 is similar. 


10 Tracking the Q variable 

We have 


E[AQ\Ti} = -1 - ^ ^2 D 2 (q ) = + 0(log 2 n + p 3 ) 


96 <3 


Q 


For a fixed index j , we define the sequence of random variables Q + (j )... Q + (Tj ), where 


Q + (f) := Q(i) - 2 np- f q 

and the stopping time Tj is the minimum of T, j and the smallest index i such that Q is not 
in the critical interval 

[ 2 np + g qi 2 np + f q \. 


We will choose the function g q , f q so that these are supermartingales. We have the inequality 


E[A(Q- 2 np- f q ) | Ti] < + 3nhp-n~^f' q + 0(log 2 n+p~i +n~ 1 f q ) 

< Qn~h+ + 3nhp- nT^fq + 0(\og 2 n + p~i +n~ 1 f q ) 

< n~ 1 p~ 1 f e2 - ^n~hg q - n~^ f' g + O + log 2 n + p~ i +n -1 /"^ 

< -n (log 6 n-p- 1 ) 

Note that in the first line we have used m- The last line can be verified by observing that 
n~ 1 p~ 1 f e 2 - ^n~hg q - n~i f q < ~p~ l log 6 n. (18) 
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Now we use Azuma-Hoeffding to bound the probability that the supermartingale strays 
too far. We have 


|AQ + (io, j)\ < 0 {fd 2 ) = 0 (n i log® n). 

Thus, if Q{i) crosses its upper boundary at the stopping time T, then there is some step j 
(with T = Tj) such that 


Q + (j ) < -{f q ~ 9g ) + 0(n * log® n) 

and Q + (Tj) > 0. Applying Azuma-Hoeffding we see that the event that the supermartingale 
Q + has such a large upward deviation has probability 


exp 


-Cl 



= °(n ) 


As there are at most 0(n s log= n ) such supermartingales, with high probability none of them 
have such large upward deviations, and Q stays below its upper bound. The lower bound for 
Q is similar. 


11 Di } r(0) 

We have the expected 1-step change 


and so before T, 


E[AD liR {0)\Ti\ 


£>2,fl(0) 

Q 


E[D ltR (0)\Ti\ - n hp » 

< -n -1 p -1 / d 2,0 + O [n~ip~i log nf^j 
We dehne the sequence of random variables 

C+(0,i) := C(0,i) - i (p~i - l) - h dl , o 

where 

hdi,o '■= log 3 n(2 + 2 t + 2 t 2 )p~i. 

we have 

E[AC + (0,i)\T,] 

< 2 ,o - n~?ti dlfi 

+0 (n~ip~i log nf q + n _1 /i' d ' 10 ) 

< —fl log 3 rap - s') 
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The last line can be verified by observing that 


1 1 3 3 5 

-n~ 1 p~ 1 f d2fi ~ n~*ti dlfi < — -n _ 3 log 3 np ~ 3 . 


(19) 


2 J ’ ai ’ u “ 2 
We will apply the following inequality due to Freedman j5]. 

Lemma 11 . 1 . Let Xi be a supermartingale, with AX\ < C for alii, andV(i) := V ar[AX (k)\J-k] 

Then 

P[3i :V (z) < v, Xi — Xq > d] < exp ( — 


k<i 


2(v + Cd) J 


For our application of this inequality, we can use (7 = 1. To determine a suitable value 
for v, note first that before T we have 


Var[AC+(0,k)\X k ] = V ar[AC (0, k)\P k \ < E [(AC(0,fc)) 2 \T k 
D2,R{0,k) 


Q{k) 


<n 2 t(k)p{t(k )) 3 


so we bound the sum 


2 t(k)p(t(k)) 3 < 

k<i 


rt(i) 


rp(r) 3 dr<p(t(i))~ 


(note that tp 2 ^ is increasing in t so we may bound the sum with an integral) so we set 
v = p(t(i ))~3 . Now we see by Lemma fl 1. ll that with d = p(t(i))~ 3 logn, with high probability 
the supermartingale D+ R (0,i) is no larger than d. Therefore before T we have the upper 
bound 


Di,r(0,i) < i (p 3 - l) + h dlfi +p(t(i)) 3 log n. 

The lower bound for -Di.#(0,z) is similar. 

In particular, T>i,_r(0,z 0 ) = \p{t(i 0 ))~i (l+o(l)) = 0 (n 1/6 ) , while D^r{v, z 0 ), Hi,l(u,zo) = 
O (log 2 n) for all v ^ 0. The behavior of £> 1 ^( 0 ), and the fact that £> 1 ^( 0 ) appears in many 
of our big -0 terms, would seem to indicate that some new ideas would be needed to track 
the sum-free process much further (e.g. to prove Conjecture 13. IF 
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discussions 
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